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Abstract 



We have studied the potential of the CDF and D0 experiments to dis- 
cover a low-mass Standard Model Higgs boson, during Run II, via the pro- 
cesses pp — > WH ivbb, pp — > ZH £~^i~bb and pp — > ZH — > i/i^bb. We 
show that a multivariate analysis using neural networks, that exploits all the 
information contained within a set of event variables, leads to a significant 
reduction, with respect to any equivalent conventional analysis, in the inte- 
grated luminosity required to find a Standard Model Higgs boson in the mass 
range 90 GeV/c^ < Mh < 130 GeV/c^. The luminosity reduction is suffi- 
cient to bring the discovery of the Higgs boson within reach of the Fermilab 
Tevatron experiments, given the anticipated integrated luminosities of Run 
II, whose scope has recently been expanded. 

PACS Numbers: 14.80.Bn, 13.85.Qk 
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I. INTRODUCTION 



The success of the Standard Model (SM) of particle physics, which provides an accurate 
description of almost all particle phenomena observed so far Hlj-^, has been spectacular. 
However, one crucial aspect of it remains mysterious: the fundamental mechanism that 
underlies electro-weak symmetry breaking (EWSB) and the origin of fermion mass. Elu- 
cidating the nature of EWSB is the next major challenge of particle physics and will be 
the focus of upcoming experiments at the Fermilab Tevatron and the CERN Large Hadron 
Collider (LHC) during the early years of the twenty-first century. 

In many theories, EWSB occurs through the interaction of one or more doublets of scalar 
(Higgs) fields with the initially massless fields of the theory. An important goal over the next 
decade is to determine whether or not, in broad outline, this picture of EWSB is correct. In 
the Standard Model there is a single scalar doublet. The EWSB endows the weak bosons 
{W^,Z) with masses and gives rise to a single physical neutral scalar particle called the 
Higgs boson (Hsm). In minimal supersymmetric (SUSY) extensions of the SM, two Higgs 
doublets are required resulting in five physical Higgs bosons: two neutral CP-even scalars 
{h,H), a neutral CP-odd pseudo-scalar (A) and two charged scalars {H"^). Non-minimal 
SUSY theories generally posit more than two scalar doublets. 

Given this picture of EWSB, the direct and indirect measurements of the top quark and 
W boson masses constrain the mass of the SM Higgs boson (Mngj^j), as indicated in Fig |l|. 
A global fit to all electroweak precision data, including the top quark mass, gives a central 
value of Mhsm = 107145 GeV/c^ and a 95% confidence level upper limit of 225 GeV/c^ 
||l|]. In broad classes of SUSY theories the mass of the lightest CP-even neutral Higgs 
boson, h, is constrained to be less than 150 GeV/c^ [^. In the minimal supersymmetric 
SM (MSSM), the upper bound on Mh is lowered to about 130 GeV/c^ This bound 

is reasonably robust with respect to changes in the parameters of the theory. Furthermore, 
in the limit of large pseudo-scalar Higgs boson mass, M4 >> Mz, where Mz is the mass 
of the Z boson, the properties of the lightest MSSM Higgs boson h are indistinguishable 
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from those of the SM Higgs boson, Hsm- These intriguing indications of a low- mass Higgs 
boson motivate the study of strategies that maximize the potential for its discovery at the 
upgraded Tevatron 0. This paper describes a strategy that achieves this goal. 

The current 95% CL lower limit on the Higgs boson mass, from the CERN e^e~ collider 
LEP, is 107.9 GeV/c^ @] and is expected to reach close to 114 GeV/c^ 0] in the near 
future. We have therefore studied the mass range 90 GeV/c^ < Mh < 130 GeV/c^, where 
if, hereafter, denotes the SM Higgs boson, Hsm- The cross sections for SM Higgs boson 
production at the Fermilab Tevatron are shown in Fig ^ At a/s = 2 TeV, the dominant 
process for the production of Higgs bosons in pp collisions is gg H . The Higgs boson 
decays to a hh pair about 85% of the time. Unfortunately, even with maximally efficient 
6-tagging this channel is swamped by QCD di-jet production. The more promising channels 
are pp WH iubb, pp ZH £^i^ hh and pp ZH uuhb, which are the ones we 
have studied. 

In WH events the lepton can be lost because of deficiencies in the detector or the event 
reconstruction or the lepton energy being below the selection threshold. For such events the 
reconstructed final state would be indistinguishable from that arising from the process pp — *■ 
ZH uubb. We have therefore studied these processes in terms of the channels: single 
lepton {£ + ]/]rp + bb from WH), di-lepton {i^i^bb from ZH) and missing transverse energy 
{]^rp + bb from ZH and WH), where I^j- denotes the missing transverse energy from all 
sources, including neutrinos. For each of these channels, we have carried out a comparative 
study of multivariate and conventional analyses of these channels in which we compare signal 
significance and the integrated luminosity needed for discovery. 

The paper is organized as follows: In Sec. || we describe our strategy in general terms. 
Sections ^ and |V|, respectively, describe our analyses of the single lepton, di-lepton and 
missing transverse energy channels. Our conclusions are given in Sec. [V^. 
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II. OPTIMAL EVENT SELECTION 



In conventional analyses a cut is applied to each event variable, usually one variable 
at a time, after a visual examination of the signal and background distributions. Although 
analyses done this way are sometimes described as "optimized," in practice, unless the signal 
and background distributions are well separated, the traditional procedure for choosing cuts 
is rarely optimal in the sense of minimizing the probability to mis-classify events. Since we 
wish to maximize the chance of discovering the Higgs boson we need to achieve the optimal 
separation between signal and background, while maximizing the signal significance. Given 
any set of event variables, optimal separation can always be achieved if one treats the 
variables in a fully multivariate manner. 

Given a set of event variables, it is useful to construct the discriminant function D given 

by 

s(x) + 0(x) 

where x is the vector of variables that characterize the events and s(x) and &(x), respec- 
tively, are the n— dimensional probability densities describing the signal and background 
distributions. The discriminant function D = r/(l + r) is related to the Bayes discrimi- 
nant function which is proportional to the likelihood ratio r = s(x)/6(x). Working with 
D, instead of directly with x, brings two important advantages: 1) it reduces a difficult 
n— dimensional optimization problem to a trivial one in a single dimension and 2) a cut on 
D can be shown to be optimal in the sense defined above. 

There is, however, a practical difficulty in calculating the discriminant D. We usually 
do not have analytical expressions for the distributions s(x) and &(x). What is normally 
available are large discrete sets of points Xj, generated by Monte Carlo simulations. For- 
tunately, however, there are several methods available to approximate the discriminant D 
from a set of points Xj, the most convenient of which uses feed-forward neural networks. 
Neural networks are ideal in this regard because they approximate D directly [pJ] , p^ . 
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Many neural network packages are available, any one of which can be used to calculate D. 
We have used the JETNET package ||T^ to train three-layer (that is, input, hidden and out- 
put) feed-forward neural networks (NN). The training was done using the back-propagation 
algorithm, with the target output for the signal set to one and that for the background 
set to zero. In this paper we use the terms "neural network output" and "discriminant" 
interchangeably. However, the distinction between the exact discriminant as we have 
defined it above, and the network output, which provides an estimate of should be borne 
in mind. 

III. SINGLE LEPTON CHANNEL 

We have considered final states with a high pt electron (e) or muon (/i) and a neutrino 
from W decay and a hh pair from the decay of the Higgs boson. The WH events were 
simulated using the PYTHIA program ]14[ for Higgs boson masses of = 90, 100, 110, 
120 and 130 GeV/c^. In Table 1 we list the cross section x branching ratio (BR) we have 
used for the process pp WH — > iubb where i = e, fi, r. 

The processes pp — > Wbb, pp WZ, pp — > ti, single top production — pp — > W* tb 
and pp Wg —>■ tqb, which have the same signature, iubb, as the signal, are the most 
important sources of background. They have all been included in our study. The Wbb sample 
was generated using CompHEP [jT3|, a parton level Monte Carlo program based on exact 
leading order (LO) matrix elements. The parton fragmentation was done using PYTHIA. 
The single top, ti and WZ events were simulated using PYTHIA. To generate the s-channel 
process, W* tb, we forced the W to be produced off-shell, with > rrit + rub, and 
then selected the final state in which W tb. The cross sections used for the background 
processes are given in Table I. 

To model the expected response of the CDF and D0 Run II detectors at Fermilab we 
used the SHW program |jl6| , which provides a fast (approximate) simulation of the trigger. 



tracking, calorimeter clustering, event reconstruction and 5-tagging. The SHW simulation 
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predicts a di-jet mass resolution of about 14% at Mh = 100 GeV/c^, varying only slightly 
over the mass range of interest. However, to allow for comparisons with the other WH and 
ZH studies at the Physics at Run II SUSY/Higgs workshop some of which do not use 
SHW, we have re-scaled the di-jet mass variables for all signal and background events so 
that the resolution is 10% at each Higgs boson mass. The consensus of Run II workshop is 
that such a mass resolution can be achieved, albeit with considerable effort. 

In principle, multivariate methods can be applied at all stages of an analysis. However, 
in practice, experimental considerations, such as trigger thresholds and the need to restrict 
data to the phase space in which the detector response is well understood, dictate a set of 
loose cuts on the event variables. These cuts define a base sample of events. In our case, the 
base sample was determined by the following cuts: 

• the transverse momentum of the isolated lepton > 15 GeV/c 

• the pseudo-rapidity of the lepton \r](\ < 2 

• the missing transverse energy in the event > 20 GeV 

• two or more jets in the event with > 10 GeV and |?7jei| < 2. 

Since the Higgs decays into a bb pair we impose the requirement that two jets be 6-tagged. 
This of course does little to reduce the dominant Wbb background, due to the presence of 
the bb pair, but it becomes powerful when the invariant mass, Mf^i, of the 6-tagged jets is 
used as an event variable. The di-jet mass distributions for the signal is expected to peak at 
the Higgs boson mass, whereas one expects a broad distribution for the background, with 
the exception of the WZ background which peaks at the Z boson mass. 



One of the 6-tags was required to be tight and the other loose |TB]. A tight 6-tag is defined 



by an algorithm that uses the silicon vertex detector, while a loose 6-tag is defined by the 



same algorithm with looser cuts or by a soft lepton tag |T^. The mean double 6-tagging 
efficiency in SHW is about 45%. 
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We searched for variables that discriminate between the signal and the backgrounds and 
arrived at the following set: 

• E^, - transverse energies of the 6-tagged jets 

• Mf^i - invariant mass of the 6-tagged jets 

• Ht - sum of the transverse energies of all selected jets 

• Ej, - transverse energy of the lepton 

• r]e - pseudo-rapidity of the lepton 

• - missing transverse energy 

• S - sphericity {S = ^{Qi + Q2) where Qi are the eigenvalues obtained by diagonalizing 
the normalized momentum tensor Mab = J2iPiaPib/ J2i b«P where the sums are over the 
final state particle momenta and the subscripts a and b refer to the spatial components 
of the momenta Pi 

• Ai?(6i,62) - the distance, in the (r^, 0) -plane, between the two 6-tagged jets, where 
AR = \/ Arj'^ + A(f)'^ and (p is the azimuthal angle 

• Ai?(6i, i) - the AR distance between the lepton and the first 6-tagged jet. 

Most of the variables used are directly measured (reconstructed) kinematic quantities 
while some are deduced variables. The choice of M^^ as a discriminating variable is obvious, 
as discussed earlier. The variable Ht is a measure of the "temperature" of the interaction; 
a large Ht is a sign of the decay of massive objects. For example, WH events would have 
larger Ht (increasing with Mfj) than the Wbb background, but smaller Ht than the tt 
background. The WH events are also more spherical than the Wbb events and have larger 
values of sphericity. The AR{b,b) is smaller for Wbb background where the 6-jets come 
mainly from g ^ bb than in WH events where the 6-jets come from the heavy object decay 
H^bb. 
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For each Higgs boson mass we trained three networks to discriminate against the main 
backgrounds Wbb, WZ and ti. The subsets of variables used to train the networks are 
hsted in Table II while in Fig ^a-c) we show the distributions of some of these variables. 
Each network has 7 input variables, 9 hidden nodes and one output node. We calcuated 
three discriminants D for every signal and background event and for every Higgs boson mass. 
Figure ^(d) shows the distributions of the discriminants for signal and background calculated 
using the network trained to discriminate between signal events, with Mh = 100 GeV/ c^, and 
the specified background. We note that all backgrounds, with the exception of WZ, are well 
separated from the signal. For Higgs boson masses close to the Z mass the WZ background 
is kinematically identical to the signal and therefore difficult to deal with. But for Higgs 
boson masses well above the Z mass the discrimination between WH and WZ improves, 
as does that between WH and the other backgrounds. (In all figures, the signal histograms 
are shaded dark while the background histograms are shaded light.) The arrows in Fig. ^(d) 
indicate the cuts applied to the discriminants. The cuts were chosen to maximize S/ a/B, 
where S and B are the signal and background counts, respectively. The cuts to suppress 
the WZ background vary from 0.18 to 0.80, increasing for higher Higgs boson masses; the 
cuts to suppress Wbb are generally about 0.8, while those for top events are in the range 
0.35 to 0.75. 

At this stage it is instructive to compare the conventional and multivariate approaches, 
to assess what has been gained by using the latter approach. In Fig. ^ we compare the signal 
efficiency vs. background efficiency (given in terms of the number of events for 1 fb~^) for an 
ensemble of possible cuts on the three discriminants (using the random grid search technique 
||17|| ) with the efficiencies obtained using the standard cuts defined by the Run II Higgs 
Workshop [^. Each dot corresponds to a particular set of cuts on the three discriminants; 
the triangular marker indicates what is achieved using the standard cuts, while the star 
indicates the results obtained from an optimal choice of cuts (which maximizes S/\fB) on 
the three network outputs. Table HI shows results for the WH channel. 
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IV. DI-LEPTON CHANNEL 



For the di-lepton channel we followed a strategy similar to that described for the single 
lepton channel. The final state signature considered is: two high Pt same flavor leptons (ee 
or from Z boson decay and two b-jets (from H ^ bb). 

The ZH events were generated using PYTHIA for Higgs boson masses of 90, 100, 110, 
120 and 130 GeV/c^. The principal backgrounds are due to ZZ, Zbb, single top and tt pro- 
duction. The Zbb background sample was generated using CompHEP, with fragmentation 
done using PYTHIA, while all other samples were generated using PYTHIA. As before, the 
SHW program was used to simulate the detector response and we assumed that two jets are 
6-tagged (one tight and one loose). The cross sections for signal and background are shown 
in Table I. The base sample was determined by the following cuts: 

• > 10 GeV/c 

• \m\ < 2 

• < 10 GeV 

• at least two jets with E^'^ > 8 GeV and \7jjet\ < 2. 

A network was trained for each Higgs boson mass and for each of the three backgrounds 
with the following variables 

_ rpbl rpb2 

• ll/rp , Jl/rp 

• Pt of the two leptons 

• M^i - invariant mass of the leptons 

• Ht 

• AR{bi,£) between the first lepton and the first 6-tagged jet. 



Distributions of these variables, as well as those of the network output, are shown in 
Fig|^(a-d). The signal distributions are for Mh=100 GeV/c^. Our results after applying 
cuts on the three network outputs, for the di-lepton channels are summarized in Table IV. 

V. MISSING TRANSVERSE ENERGY CHANNEL 

This channel has contributions from both ZH — > uubb and WH [tjvhb where (£) 
denotes the lepton that is lost. The event generation and detector simulation were carried 
out as described in the single lepton and di-lepton channel studies. The base sample was 
defined by the cuts 

• \rii\ < 2 

• $T > 10 GeV/c 

• no isolated lepton with > 10 GeV/c 

• E^"*^ < 30 GeV 

• at least two jets with E'jf* > 8 GeV and \rjjet\ < 2. 

The three networks were trained with ZH vvhb events as signal and Zhh^ ZZ and ti as 
the three backgrounds, respectively. The same networks were used to evaluate contributions 
from WH and the relevant backgrounds. We used the following variables to train the 
networks: 

• njrp , riirp 
• 

• Ht 

• 5* 
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• C- centrality {EjetsEr/ EjetsE, with E^^* > 15 GeV) 



• minimum A(/)(jet, -^j. ). 

The centrahty, C, has larger mean value (as is the case with S) for signal events than for 
backgrounds. The variable is a measure of the significance of the missing transverse 



energy. The smallest of azimuthal angles between J^'p and the jets in the event is expected 
to be smaller for Wbb, Zbb as well as high multiplicity ti events than in signal events. We 
show the distributions of the variables and neural network outputs in Figs. |^(a-d) . Again the 
signal distributions are for Mh=100 GeV/c^. The results for this channel, after optimized 
cuts on network outputs, are listed in Table V. 



In Table VI we compare the results of our multivariate analysis with those based on 
the standard cuts, while Table VII and Figs. |^ and ^ show our final results, where we have 
combined all channels. The striking feature of these results is the substantial reduction in 
integrated luminosity required to make a 5a discovery of the Higgs boson if one adopts a 
multivariate approach instead of the traditional method based on univariate cuts. In each 
of the three channels, the signal significance, which we define as S/\fB, is seen to be 20- 
60% higher from our multivariate analysis as compared to an optimal conventional analysis. 
For example, at Mh = 110 GeV/c^ we find that the required integrated luminosity for 
a 5cr observation decreases from 18.3 fb^^ to 8.5 fb~^. The results in Table VII include 
statistical errors only. The dominant systematic error will likely be due to background 
modeling. However, given the large data-sets expected by the end of Run II we can anticipate 
that a thorough experimental study of the relevant backgrounds will have been undertaken. 
Therefore, it is possible that systematic errors could, eventually, be reduced to well under 
10%. We can estimate the effect of systematic error by adding it in quadrature to the 




VI. DISCUSSION AND SUMMARY 
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statistical error. If we assume a 10% systematic error on the total background the required 
integrated luminosity for a 5cr observation increases from 8.5 fb^^ to 12.8 fb~^. 

Run II at the Tevatron with the CDF and D0 detectors will begin in early 2001. Recently 
the scope of Run II has been expanded. The goal (hope) is to collect about 15-20 fb~^ per 
experiment in the period up to and including the start of the LHC After 5 years of running, 
each experiment could see a 3a-5a signal of a neutral Higgs boson with Mh < 130 GeV/c^. 
This exciting possibility for the Tevatron is the principal motivation for the recent important 
decision to expand the scope of Run II in order to accumulate as much data as possible. 
However, even with the expanded scope a discovery may be possible only if these data are 
analyzed with the most efficient methods available, such as the one we have described in this 
paper. It is important to note that the results we have presented are for a single experiment. 
That is, our conclusion is that each experiment has the potential of making an independent 
discovery. If the experiments combine their results the discovery of a low-mass Higgs boson 
at the Tevatron might be at hand a lot sooner. 
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TABLES 



W H - 


■> ivbb 


Zti 


i^t bb 




ZH - 


> uvbb 


Mh (GeV/c"^) 


a X i?i?(fb) 


Mh (GeV/c^J 


a X i3i?(fb) 


Mh (GeV/c2) 


a X i?i?(fb) 


90 


119.0 


90 


20.3 


90 




40.6 


100 


85.4 


100 


14.8 


100 




29.6 


110 


62.3 


110 


10.9 


110 




21.8 


1 on 


45.3 


1 on 
izU 


Q OO 

o.zz 


120 




iD.4 


130 


34.1 


130 


6.25 


130 




12.5 


Backgrounds 














Whh 


3500.0 




350.0 


Zbb 




700.0 


WZ 


164.8 












tbq 


800.0 


tbq 


800.0 


tbq 




800.0 




a(fb) 




a(fb) 






a(fb) 






ZZ 


1235.0 


ZZ 




1235.0 


tb 


1000.0 


tb 


1000.0 


tb 




1000.0 


tt 


7500.0 


tt 


7500.0 


tt 




7500.0 



TABLE L Cross section times branching ratio for the WH and ZH processes we have studied, 
for various Mh [0] and for the various backgrounds. Note: For tb, tt and ZZ processes we give 
the total cross section. 



Wbb WZ tt 



rpbl 

H/rp 


rpbl 

H/rp 


rpbl 


pb2 

LL/rp 


H/rp 


H/rp 








Hj- 


Hj^ 


Hf 




Ej^ 


$T 


S 


S 


AR{bi,i) 


^R{biM) 


m 


AR{bi,b2) 



TABLE IL Single lepton channel. Variables used in training the neural networks for signals 



against specific backgrounds. 
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Mfj LreV /c 


90 


lUU 


1 1 n 
liU 


1 on 


1 on 


JNumber oi events(l rb J 
T;r/ IT 


O.DO 


o.y / 


A CI 


A A1 

4.41 




W 00 


iz.zo 


iz.4o 


CC OA 

o.o4 


a act 
y.DO 


on 1 o 


TIT" r7 

w z 


7.52 


10.32 


1.72 


1 AA 

1.00 


A AT 

0.97 


J. L 


0.51 


A A CT 

0.95 


A CO 

0.58 


A T1 

0.71 


0.9d 


J. 7, 

to 


2.4d 


f /J A 

5.40 


3.44 


r OA 

5.89 


A OO 

9.33 


tt 


O.DO 


A OA 

9.89 


7.24 


O OA 

8.39 


1 A ACi 

14.49 


Total background 


Z0.4U 


oy.U4 


lo.ol 




A K QT 


Signal significance 












S/B 


0.31 


0.23 


0.26 


0.17 


A AO 1 

0.081 


S/\/B (1 fb ^) 


1.62 


1.44 


1.11 


0.87 


0.55 


S/\/!b (2 fb~^) 


2.29 


2.04 


1.57 


1.23 


0.78 


S//B (30 fb-i) 


8.87 


7.89 


6.08 


4.77 


3.01 


Required luminosity (fb~^) 












5a 


9.5 


12.1 


20.3 


33.0 


82.6 


3a 


3.4 


4.3 


7.3 


11.9 


29.8 


1.96a (95% CL) 


1.5 


1.9 


3.1 


5.1 


12.7 


TABLE III. Single lepton channel. Results for the number of signal and background events 
(top portion of the table) for 1 fb~^ of integrated luminosity. The cuts on the network outputs were 
chosen to yield maximum significance for each Higgs boson mass, leading to different background 



counts at each mass. 



Mh (GeV/c2) 


90 


100 


110 


120 


130 


Number of events 












ZH 


1.26 


0.87 


0.79 


0.80 


0.58 


Zbb 


0.61 


0.45 


0.61 


1.50 


1.42 


ZZ 


2.04 


1.44 


1.42 


0.83 


0.31 


tt 


0.28 


0.05 


0.23 


O.li 


0.18 


Total background 


2.93 


1.94 


2.26 


2.77 


1.91 


S/B 


0.43 


0.45 


0.35 


0.29 


0.31 


s/Vb 


0.74 


0.63 


0.54 


0.48 


0.42 



TABLE IV. Di-lepton channel. Results for 1 fb"^. 
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M}{ (LreV/c j 




1 nn 
1(JU 


1 1 n 
ill) 


1 on 


1 on 


JN umber oi events 












Zti 


O.OD 


4.37 


3.53 


2.7b 


z.lo 


TIT" TT 

W n 


5.59 


3.75 


Z.79 


1 no 

1.98 


1 'rn 

1.70 


Total signal 


12.25 


8.12 


6.32 


A 1^ A 

4.74 


3.86 


Zbb 


8.12 


4.97 


4.83 


3.85 


3.92 


W 00 


zi./U 




111. DO 


0.22 




Z Z 




a. '\ A 
0.14 


z.oy 




n ccn 
U.by 


TT7" r7 


7.95 


A AC\ 

4.49 


1.99 


0.90 


0.54 


tqb 


0.63 


0.27 


0.37 


0.24 


0.29 


tb 


6.83 


2.99 


4.27 


5.12 


6.40 


tt 


5.10 


2.70 


3.00 


3.00 


4.35 


Total background 


61.57 


34.8 


27.73 


22.38 


23.62 


S/B 


0.20 


0.23 


0.23 


0.21 


0.16 


S/^/B 


1.56 


1.38 


1.20 


1.00 


0.79 


TABLE V. Missing 


; transverse energy channel. Results for 1 fb ^. 




channel 


mass 


standard 


neural 




r NN / T std 
Ij ij 




(GeV) 


cuts 


net 




(lor 5cr obsv.j 


e + ^T +bb 


100 


0.98 


1.44 




0.46 




110 


0.69 


1.11 




0.39 




120 


0.58 


0.87 




U.44 




130 


0.44 


0.55 




0.64 


$T +bb 


100 


1.09 


1.38 




0.62 




110 


0.85 


1.20 




0.50 




120 


0.67 


1.00 




0.49 




130 


0.54 


0.78 




0.47 


£+i-bb 


100 


0.48 


0.63 




0.58 




110 


0.40 


0.52 




0.59 




120 


0.40 


0.48 




0.69 




130 


0.33 


0.42 




0.61 



TABLE VL Comparison of S/\/B achievable with conventional and neural networks cuts. 
Shown in the last column are the ratios of integrated luminosity required in the multivariate 
analysis to that required in the conventional analysis for a 5(7 observation. 
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]\/r fCoXT /r2\ 


on 
yu 


1 nn 

lUU 


1 1 n 


1 on 


ioU 


0/ /t~> /-I n — 1\ 

b/V-D (1 lb 


2.4 


2.1 


1.7 


1.4 


1.0 


b/y B (2 lb j 


6.6 


3.0 


ZA 


2.0 


1.5 


S/v^ (30 fb~^) 


12.9 


11.5 


9.4 


7.7 


5.7 


Required luminosity 












5(7 (Conventional) 


7.5 


10.5 


18.3 


26.6 


42.2 


5cj (NN) 


4.5 


5.7 


8.5 


12.6 


22.7 


3a (NN) 


1.6 


2.1 


3.0 


4.5 


8.2 


95% CL (NN) 


0.7 


0.9 


1.3 


1.9 


3.5 


TABLE VII. Combined results of all three channels. We have simply added the signal counts 
and background counts from all three channels to get the total expected signal and background, 



respectively. 
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FIG. 1. The correlation between the W boson mass and the top quark mass as predicted by 
the standard model, for various possible values of the Higgs boson mass. (Each line corresponds 
to the mass value shown.) Also shown are the 68% CL contours from direct (dashed contour) and 
indirect (solid contour) measurements of the W boson and top quark mass. From Ref. 
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FIG. 2. Cross sections for various Higgs production processes in pp collisions at ^/s = 2 TeV 
as a function of Higgs boson mass. From Ref. 
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FIG. 3. Distributions of some of the variables used in the NN analysis for WH {Mh=WO 
GeV/c^) signal (heavily shaded) and backgrounds (lightly shaded) (a) WH vs. Wbb, (b) WH vs. 
WZ, (c) WH vs. tt. In (d) we compare the neural network output distributions for signal and 
various backgrounds. The arrows indicate the cuts. 
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• neural network cuts 
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FIG. 4. Single lepton channel. The number of signal events vs. number of background events 
for 1 fb^^ using various combination of cuts on the three neural network outputs. The standard 
cuts are optimized based on studies done in the Higgs working group using conventional methods. 
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(a) ZH vs. Zbb 
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(c) ZH vs. tt 
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FIG. 5. Di-lepton channel. Distributions of variables used in training the neural networks for 
signal (with Mh = 100 GeV/c^) and different backgrounds and the results of the trained networks, 
(a) Signal vs. Zbb background; (b) signal vs. ZZ background; (c) signal vs. tt background and 
(d) distributions of neural network outputs for networks trained using signal vs. the backgrounds 
ZZ, Zbb and tt. The signal histograms are heavily shaded. The arrows indicate the cuts. 
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(a) ZH vs. Zbb (b) ZH vs.ZZ 




Hji(GeV) M-(GeV/c^) H^GeV) Mf^(GeV/c') 




Hji(GeV) M-^(GeV/c^) 




Sphericity E/^E/J (GeV'^^) it tqb 



FIG. 6. Missing transverse energy channel. Distributions of variables used in training the 
neural networks for signal (with Mh = 100 GeV/c^) and different backgrounds, together with 
distributions of network outputs, (a) Signal vs. Zbb; (b) signal vs. ZZ; (c) signal vs. tt and (d) 
distributions of neural network outputs for networks trained using signal vs. the backgrounds ZZ, 
Zbb and tt. The signal histograms are heavily shaded. The arrows indicate the cuts. 
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FIG. 7. Required integrated luminosity, with all channels combined, at 5a, 3a and 1.96cj (95% 
C. L.) significance, for NN analysis. 
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FIG. 8. Comparison of required integrated luminosity for a 5a observation with all channels 
combined for NN and standard cuts. The luminosities given are for a single Tevatron experiment, 
as in the previous plots. For a given integrated luminosity the NN analysis provides a much higher 
discovery reach in mass. 
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